Semi-supervised Learning for Spoken Language Understanding Using Semantic Role Labeling
نویسندگان
چکیده
In a goal-oriented spoken dialog system, the major aim of language understanding is to classify utterances into one or more of the pre-defined intents and extract the associated named entities. Typically, the intents are designed by a human expert according to the application domain. Furthermore, these systems are trained using large amounts of data manually labeled using an already prepared labeling guide. In this paper, we propose a semi-supervised spoken language understanding approach based on the taskindependent semantic role labeling of the utterances. The goal is to extract the predicates and the associated arguments from spoken language by using semantic role labeling and determine the intents based on these predicate/argument pairs. We propose an iterative approach using the automatically labeled utterances with semantic roles as the seed training data for intent classification. We have evaluated this understanding approach using two AT&T spoken dialog system applications used for customer care. We have shown that the semantic parses obtained without using any syntactically or semantically labeled in-domain data can represent the semantic intents without a need for manual intent and labeling guide design and labeling phases. Using this approach on automatic speech recognizer transcriptions, for both applications, we have achieved the 86.5% of the performance of a classification model trained with thousands of labeled utterances.
منابع مشابه
Combining active and semi-supervised learning for spoken language understanding
In this paper, we describe active and semi-supervised learning methods for reducing the labeling effort for spoken language understanding. In a goal-oriented call routing system, understanding the intent of the user can be framed as a classification problem. State of the art statistical classification systems are trained using a large number of human-labeled utterances, preparation of which is ...
متن کاملAn Active Learning Approach for Statistical Spoken Language Understanding
In general, large amount of segmented and labeled data is needed to estimate statistical language understanding systems. In recent years, different approaches have been proposed to reduce the segmentation and labeling effort by means of unsupervised o semi-supervised learning techniques. We propose an active learning approach to the estimation of statistical language understanding models that i...
متن کاملSemi-supervised Semantic Role Labeling Using the Latent Words Language Model
Semantic Role Labeling (SRL) has proved to be a valuable tool for performing automatic analysis of natural language texts. Currently however, most systems rely on a large training set, which is manually annotated, an effort that needs to be repeated whenever different languages or a different set of semantic roles is used in a certain application. A possible solution for this problem is semi-su...
متن کاملA Weakly Supervised Learning Approach for Spoken Language Understanding
In this paper, we present a weakly supervised learning approach for spoken language understanding in domain-specific dialogue systems. We model the task of spoken language understanding as a successive classification problem. The first classifier (topic classifier) is used to identify the topic of an input utterance. With the restriction of the recognized target topic, the second classifier (se...
متن کاملAssessing context and learning for isizulu tone recognition
Prosody plays an integral role in spoken language understanding. In isiZulu, a Nguni family language with lexical tone, prosodic information determines word meaning. We assess the impact of models of tone and coarticulation for tone recognition. We demonstrate the importance of modeling prosodic context to improve tone recognition. We employ this less commonly studied language to assess models ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005